Tree-based recursive partitioning methods for subdividing sibpairs into relatively more homogeneous subgroups.
نویسندگان
چکیده
We propose a new splitting rule for recursively partitioning sibpair data into relatively more homogeneous subgroups. This strategy is designed to identify subgroups of sibpairs such that within-subgroup analyses result in increased power to detect linkage using Haseman-Elston regression. We assume that the subgroups can be defined by patterns of non-genetic binary covariates measured on each sibpair. The data we consider consists of the squared difference of a quantitative trait measurement on each sibpair, estimates of identity-by-descent (IBD) values at each genetic marker, and binary covariate data describing characteristics of the sibpair (e.g., race, sex, family history of disease). To test the efficacy of this method in linkage analysis, we performed two simulation experiments. In the first, we simulated a mixture consisting of 66.6% of the sibpairs with no linkage and 33.3% of the sibpairs with genetic linkage to one marker. The two groups were distinguished by the value of a single binary covariate. We also simulated one unlinked marker and one random covariate to include as noise in the data. In the second experiment, we simulated a mixture consisting of 55% of the sibpairs with no genetic linkage, 22.5% of the sibpairs with genetic linkage to one marker, and 22.5% of the sibpairs with linkage to a different marker. Each subgroup was defined by a distinct pattern of two binary covariates. We also simulated one unlinked marker and two random covariates to include as noise in the data. Our simulation studies found that we can significantly increase the overall power to detect linkage by fitting Haseman-Elston regression models to homogeneous subgroups with only a small increase in the false-positive rate. Second, the splitting rule can correctly identify important covariates and linked markers. Third, recursive partitioning of sibpair data using this splitting rule can correctly identify sibpair subgroups. These results indicate that partitioning sibpairs into homogeneous subgroups is feasible and significantly increases the power to detect linkage, thus demonstrating the practical utility and potential this new methodology holds.
منابع مشابه
Decision trees in epidemiological research
BACKGROUND In many studies, it is of interest to identify population subgroups that are relatively homogeneous with respect to an outcome. The nature of these subgroups can provide insight into effect mechanisms and suggest targets for tailored interventions. However, identifying relevant subgroups can be challenging with standard statistical methods. MAIN TEXT We review the literature on dec...
متن کاملارزیابی متغیرهای پیشآگهی در ردهبندی نرخ بقای بیماران مبتلا به سرطان کولورکتال با استفاده از درخت تصمیم
Background ; Objectives: Identifying the important influential factors is a great challenge in oncology studies. Decision tree is one of methods that could be used to evaluate the prognostic factors and classifying the patients' homogeneously. This method identifies the main prognostic factors and then determines the subgroups of patients based on those prognostic factors. The aim of this...
متن کاملAneurysmal subarachnoid hemorrhage prognostic decision-making algorithm using classification and regression tree analysis
BACKGROUND Classification and regression tree analysis involves the creation of a decision tree by recursive partitioning of a dataset into more homogeneous subgroups. Thus far, there is scarce literature on using this technique to create clinical prediction tools for aneurysmal subarachnoid hemorrhage (SAH). METHODS The classification and regression tree analysis technique was applied to the...
متن کاملOriginal Contribution Applying Recursive Partitioning to a Prospective Study of Factors Associated with Adherence to Mammography Screening Guidelines
Although a number of predictors of adherence to mammography screening guidelines have been identified using traditional statistical methods, many women are not screening according to these guidelines. Recursive partitioning may aid in developing novel intervention strategies to promote this screening behavior by identifying subgroups of women that differ on adherence across predictor variables....
متن کاملApplication of Survival Tree Model in Determining Affecting Factors in Breastfeeding Duration
Background and Purpose: Survival tree model is a nonparametric method which can be used to identify the affecting factors from a specific time to the onset of an event. In this method, the categories are selected according to the most important factors. The purpose of this study was to determine the factors affecting the duration of breastfeeding in mothers and introduce the homogeneous subgrou...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Genetic epidemiology
دوره 20 3 شماره
صفحات -
تاریخ انتشار 2001